Search CORE

43 research outputs found

On-line PCA with Optimal Regrets

Author: A.T. Kalai
D.P. Helmbold
J. Kivinen
K. Tsuda
K.S. Azoury
M. Herbster
M.K. Warmuth
M.K. Warmuth
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Cesa-Bianchi
Publication venue
Publication date: 01/01/2013
Field of study

We carefully investigate the on-line version of PCA, where in each trial a learning algorithm plays a k-dimensional subspace, and suffers the compression loss on the next instance when projected into the chosen subspace. In this setting, we analyze two popular on-line algorithms, Gradient Descent (GD) and Exponentiated Gradient (EG). We show that both algorithms are essentially optimal in the worst-case. This comes as a surprise, since EG is known to perform sub-optimally when the instances are sparse. This different behavior of EG for PCA is mainly related to the non-negativity of the loss in this case, which makes the PCA setting qualitatively different from other settings studied in the literature. Furthermore, we show that when considering regret bounds as function of a loss budget, EG remains optimal and strictly outperforms GD. Next, we study the extension of the PCA setting, in which the Nature is allowed to play with dense instances, which are positive matrices with bounded largest eigenvalue. Again we can show that EG is optimal and strictly better than GD in this setting

arXiv.org e-Print Archive

CiteSeerX

Crossref

The perceptron algorithm versus winnow: linear versus logarithmic mistake bounds when few input variables are relevant

Author: Kivinen J.
Warmuth M.K.
Auer P.
Publication venue: Published by Elsevier B.V.
Publication date: 01/01/1905
Field of study

AbstractWe give an adversary strategy that forces the Perceptron algorithm to make Ω(kN) mistakes in learning monotone disjunctions over N variables with at most k literals. In contrast, Littlestone's algorithm Winnow makes at most O(k log N) mistakes for the same problem. Both algorithms use thresholded linear functions as their hypotheses. However, Winnow does multiplicative updates to its weight vector instead of the additive updates of the Perceptron algorithm. In general, we call an algorithm additive if its weight vector is always a sum of a fixed initial weight vector and some linear combination of already seen instances. Thus, the Perceptron algorithm is an example of an additive algorithm. We show that an adversary can force any additive algorithm to make (N + k −1)2 mistakes in learning a monotone disjunction of at most k literals. Simple experiments show that for k ⪡ N, Winnow clearly outperforms the Perceptron algorithm also on nonadversarial random data

Elsevier - Publisher Connector

Crossref

Galiciana

Hedging structured concepts

Author: Kivinen J.
Koolen W.M.
Warmuth M.K.
Publication venue: Omnipress
Publication date: 01/01/2010
Field of study

International Migration, Integration and Social Cohesion online publications

Hedging structured concepts

Author: Kivinen J.
Koolen W.M.
Warmuth M.K.
Publication venue: Omnipress
Publication date: 01/01/2010
Field of study

International Migration, Integration and Social Cohesion online publications

Predicting {0, 1}-Functions on Randomly Drawn Points

Author: Haussler D.
Littlestone N.
Warmuth M.K.
Publication venue: 'Elsevier BV'
Publication date: 31/12/1994
Field of study

AbstractWe consider the problem of predicting {0, 1}-valued functions on Rn and smaller domains, based on their values on randomly drawn points. Our model is related to Valiant′s PAC learning model, but does not require the hypotheses used for prediction to be represented in any specified form. In our main result we show how to construct prediction strategies that are optimal to within a constant factor for any reasonable class F of target functions. This result is based on new combinatorial results about classes of functions of finite VC dimension. We also discuss more computationally efficient algorithms for predicting indicator functions of axis-parallel rectangles, more general intersection closed concept classes, and halfspaces in Rn. These are also optimal to within a constant factor. Finally, we compare the general performance of prediction strategies derived by our method to that of those derived from methods in PAC learning theory

Elsevier - Publisher Connector

Minimax fixed-design linear regression

Author: Bartlett P.
Koolen-Wijkstra W.M. (Wouter)
Malek A. (Alan)
Takimoto E.
Warmuth M.K.
Publication venue
Publication date: 01/06/2015
Field of study

CWI's Institutional Repository

When Is There a Free Matrix Lunch?

Author: J. Kivinen
K. Tsuda
M. Warmuth
M.K. Warmuth
M.K. Warmuth
S. Arora
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

The Weighted Majority Algorithm

Author: M.K. Warmuth
N. Littlestone
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Composite Geometric Concepts and Polynomial Predictability

Author: M.K. Warmuth
P.M. Long
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Symmetry enhanced adaboost

Author: G. Loy
M. Kearns
M.K. Warmuth
M.K. Warmuth
P. Viola
P.S. Crowther
Y. Freund
Publication venue
Publication date: 01/01/2010
Field of study

This paper describes a method to minimize the immense training time of the conventional Adaboost learning algorithm in object detection by reducing the sampling area. A new algorithm with respect to the geometric and accordingly the symmetric relations of the analyzed object is presented. Symmetry enhanced Adaboost (SEAdaboost) can limit the scanning area enormously, depending on the degree of the objects symmetry, while it maintains the detection rate. SEAdaboost allows to take advantage of the symmetric characteristics of an object by concentrating on corresponding symmetry features during the detection of weak classifiers. In our experiments we gain 39 % reduced training time (in average) with slightly increasing detection rates (up to 2.4 % and up to 6 % depending on the object class) compared to the conventional Adaboost algorithm

CiteSeerX

Crossref